Designing Grid services for distributed knowledge discovery

نویسندگان

  • Antonio Congiusta
  • Andrea Pugliese
  • Domenico Talia
  • Paolo Trunfio
چکیده

The increasing use of computers in all the areas of human activities is resulting in huge collections of digital data. Databases are common everywhere and are used as repositories of every kind of data. Knowledge discovery techniques and tools are used today to analyze those very large data sets to identify interesting patterns and trends in them. When data is maintained over geographically distributed sites the computational power of distributed and parallel systems can be exploited for knowledge discovery in databases. In this scenario the Grid can provide an effective computational support for distributed knowledge discovery on large data sets. To this purpose we designed a system called Knowledge Grid. This paper describes the Knowledge Grid architecture and discusses some related systems and models recently proposed for knowledge discovery on Grids. The paper shows also how to design and implement distributed knowledge discovery services, according to the OGSA model, by using the Knowledge Grid environment starting from searching Grid resources, composing software and data elements, and executing the resulting application on a Grid.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Designing data analysis services in the Knowledge Grid

Grid environments were originally designed for dealing with problems involving compute-intensive applications. Today, however, grids enlarged their horizon as they are going to manage large amounts of data and run business applications supporting consumers and end users. To face these new challenges, grids must support adaptive data management and data analysis applications by offering resource...

متن کامل

Distributed data mining services leveraging WSRF

The continuous increase of data volumes available from many sources raises new challenges for their effective understanding. Knowledge discovery in large data repositories involves processes and activities that are computational intensive, collaborative, and distributed in nature. The Grid is a profitable infrastructure that can be effectively exploited for handling distributed data mining and ...

متن کامل

KNOWLEDGE GRID : High Performance Knowledge Discovery Services on the Grid

Knowledge discovery tools and techniques are used in an increasing number of scientific and commercial areas for the analysis of large data sets. When large data repositories are coupled with geographic distribution of data, users and systems, it is necessary to combine different technologies for implementing high-performance distributed knowledge discovery systems. On the other hand, computati...

متن کامل

Knowledge Discovery on the Grid

In the last few decades, Grid technologies have emerged as an important area in parallel and distributed computing. The Grid can be seen as a computational and large-scale support, and even in some cases as a high-performance support. In recent years, the data mining community have been increasingly using Grid facilities to store, share, manage and mine large-scale data-driven applications. Ind...

متن کامل

How Distributed Data Mining Tasks can Thrive as Services on Grids

Through a service-based approach it is possible to define services for supporting distributed and pervasive business intelligence applications in Grids. Those services can address all the tasks needed in data mining and in knowledge discovery processes starting from data selection and transport, to data analysis, knowledge models representation and visualization. By exploiting the Grid services...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Web Intelligence and Agent Systems

دوره 1  شماره 

صفحات  -

تاریخ انتشار 2003